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5_( Abstract: 



< 



We consider benchmarked empirical Bayes (EB) estimators under the basic 
area-level model of Fay and Herriot while requiring the standard benchmarking 
'/^ constraint. In this paper we determine the excess mean squared error (MSB) from 

^ constraining the estimates through benchmarking. We show that the increase due 

tXJ to benchmarking is 0{m^^), where m is the number of small areas. Furthermore, 

^>; we find an asymptotically unbiased estimator of this MSB and compare it to the 

second-order approximation of the MSB of the EB estimator or, equivalently, of the 
MSB of the empirical best linear unbiased predictor (EBLUP), that was derived 



C/2 



> 



^ 



by Prasad and Rao (19901. Morever, using methods similar to those of Butar and 
Lahiri (20031, we compute a parametric bootstrap estimator of the MSB of the 
benchmarked EB estimator under the Fay-Herriot model and compare it to the 

(T^ MSB of the benchmarked EB estimator found by a second-order approximation. 

>— ^ Finally, we illustrate our methods using SAIPB data from the U.S. Census Bureau, 



and in a simulation study. 
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^ 1. Introduction 

KA Smah area estimation has become increasingly popular recently due to a 

j^ growing demand for such statistics. It is well known that direct small-area esti- 

mators usually have large standard errors and coefficients of variation. In order 
to produce estimates for these small areas, it is necessary to borrow strength from 
other related areas. Accordingly, model-based estimates often differ widely from 
the direct estimates, especially for areas with small sample sizes. One problem 
that arises in practice is that the model-based estimates do not aggregate to the 
ixLore reliable direct survey estimates. Agreement with the direct estimates is 
often a political necessity to convince legislators of the utility of small area esti- 
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mates. The process of adjusting model-based estimates to correct this problem 
is known as benchmarking. Another key benefit of benchmarking is protection 



against model misspecification as pointed out by You, Rao, and Dick (2004) and 



Datta, Ghosh, Steorts, and Maples (2011). 



In recent years, the literature on benchmarking has grown. Among oth- 



ers, 



Pfeffermann and Barnard (1991); You and Rao (2003); You, Rao, and Dick 



(2004); Pfeffermann and Tiller (2006); and Ugarte, Militino, and Goicoa (2009) 



have made an impact on the continuing development of this field. Specifically, 



Wang, Puller, and Qu (2008) provided a frequentist method wherein an aug- 



mented model was used to construct a best linear unbiased predictor (BLUP) that 



automatically satisfies the benchmarking constraint. In addition, Datta, Ghosh, 



Steorts, and Maples (2011) developed very general benchmarked Bayes estima- 



tors, that covered most of the earlier estimators that were motivated from either a 
frequentist or Bayesian perspective. Specifically, they found benchmarked Bayes 



estimators under the Fay and Herriot ( 1979 ) model 



Due to the fact that they borrow strength, model-based estimates typically 
show a substantial improvement over direct estimates in terms of mean squared 
error (MSE). It is of particular interest to determine how much of this advantage 
is lost by constraining the estimates through benchmarking. The aforementioned 



work of Wang, Fuller, and Qu (2008) and Ugarte, Militino, and Goicoa (2009) 



examined this question through simulation studies but did not derive any prob- 
abilistic results. They showed that the MSE of the benchmarked EB estimator 
was slightly larger than the MSE of the EB estimator for their simulation studies. 
In Section 3, we derive a second-order approximation of the MSE of the bench- 
marked Bayes EB estimator to show that the increase due to benchmarking is 
0{m~^), where m is the number of small areas. 

In this paper, we are concerned with the basic area-level model of [Fay and 



Herriot (1979). We propose benchmarked EB estimators in Section 2. In Section 



3, we derive a second-order asymptotic expansion of the MSE of the benchmarked 
EB estimator. In Section 4, we find an estimator of this MSE and compare it 
to the second-order approximation of the MSE of the EB estimator or, equiv- 



alently, the MSE of the EBLUP, that was derived by Prasad and Rao (1990). 



Finally, in Section 5, using methods similar to those of Butar and Lahiri (2003), 
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we compute a parametric bootstrap estimator of the mean squared error of the 
benchmarked EB estimator under the Fay-Herriot (1979) model and compare it 
to our estimators from Section 2. Section 6 contains an application based on 
Small Area Income and Poverty Estimation Data (SAIPE) from the U.S. Census 
Bureau as well as a simulation study. Some concluding remarks are made in 
Section 7. 

2. Benchmarked Empirical Bayes Estimators 
Consider the area-level random effects model 

9i = 9i + ei, 9i = xfP + Ui, i = l,...,m; (2.1) 

where e, and Ui are mutually independent with Cj ~ N{0, D.i) and ui ~ iV(0, cr^). 
This model was first considered in the context of estimating income for small 



areas (population less than 1000) by Fay and Herriot (1979). In (2.1), the D 



are known as are the p x 1 design vectors Xi. However, the vector of regression 
coefficients /3pxi is unknown. 

When the variance component u^ is known and /3 has a uniform prior on 
W , then the Bayes estimator of 9i is given by 9f = {1 — Bi)9i + Bixf0 where 
Bi = Di{al + Di)-\ P = P{al) = {X'V'^X)-^X'V~^e, and V = B[ag{al + 
Di, . . . , cr^ + Dm)- Suppose now we want to match the weighted average of some 
estimates Si to the weighted average of the direct estimates, which we denote by t. 
We assume for our calculations that t = ^^ Wi9i =: 9w We denote the normalized 
weights by Wi, so that Yli "^i = 1- Under the loss L{9., 5) = ^^ Wi{9i — di)"^, and 



subject to ^^ Wi6i = ^^ Wi9i, the benchmarked Bayes estimator derived in Datta, 



Ghosh, Steorts, and Maples (2011) is 



gBMl ^qB^^q^_qB^^ i = 1, . . . , m; (2.2) 

where 9^ = Ylii^'^i^f • ^"^ more realistic settings, cr^ is unknown. Let Px = 
X(X^X)-iX^, hij = xJiX^Xy^Xj, Ui = ei- xj$, and $ = {X^X)'^X^6. 
In this paper, we consider the simple moment estimator given by o"^ = max{0, cj^} 
where a^ = {m - p)~^ [Yl^i "^1 ~ YlT=i ^iO- ~ ^a)] ■. which is given in 



Prasad 



and Rao (|1990|). Then the benchmarked EB estimator of 9i is 

+ (i-^D, (2.3) 



^EBMl _ qEB I (Q aEB\ 
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where Of^ = (1 - Ri)9i + BixJ${al), Bi = D,{al + A)~\ i = 1, . . . , m. The 
objective of the next two sections wiU be to obtain the MSE of the benchmarked 
EB estimator correct up to 0{m~^) and also to find an estimator of the MSE 
correct to the same order. 
3. Second-Order Approximation to MSE 



Wang et al. (2008) construct a simulation study to compare the MSE of the 



benchmarked EB estimator to the MSE of the EB estimator. In this section, we 
derive a second order expansion for the MSE of the benchmarked Bayes estimator 
under the same regularity conditions and assuming the standard benchmarking 
constraint. That is, for the model proposed in Section 2, we obtain a second- 
order approximation to the MSE of the empirical benchmarked Bayes estimator 
derived in Section 2. Take hfj = xf{X'^V~^X)~^Xj and assume that a^ > 0. 
Establishing Theorem [T] requires the regularity conditions 

(i) < Dl< inf i<i<m Di < supi<j<„ Di < Du < oo; 
(ii) ma.xi<i<mhii = 0{m~^); and 
(iii) maxi<j<m Wj = 0(m~^). 

Condition (iii) requires a kind of homogeneity of the small areas, and in 
particular, it assumes there are not a few large areas that dominate the others in 



terms of the Wi. Conditions (i) and (ii) are similar to those of Prasad and Rao 



(1990) and are often assumed in the small area estimation literature. 



Before stating Theorem 1, we first present some lemmas whose proofs are 
provided in the supplementary material and are used in the proof of Theorem 1. 
The proof of Theorem 1 can be found in Appendix B. 

Lemma 1: Let r > be arbitrary. Then 

2r' 



E 



dei 



B 



(ii) E 



da?, 



sup„2>o 



0(1), and 



dm 



dial 



2^2 



2r 



0(1) 



Recall that u = 6 - XP ^ iV(0, V). The results below then follow. 
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Lemma 2: Let r > and assume niaxi<j<m xj(3 = 0(1)- Then 

We-XPW'^''' = Op{m'') and E \\\e - XpW^''] = 0{m''). 

Lemma 3: Let z ~ Np{0, S). For matrices Apxp and Bpxp, where B symmetric, 
we have 

(i) Cov{z^Az,z^Bz) = 2tr{AllBT.) 

(ii) Cov{z'^Az, [z^Bzf) = Atr{AT.BT.)tr{BT.) + 8tr{AT.BT,BT.). 

Lemma 4: E[ial - alf] = 2{m - p)-^ ET=ii'^l + Dif + 0{m-^). 

Theorem 1. // regularity conditions (i)-(iii) hold, then E[{6f^^^^ — Oi)^] = 
9ii{crl) + 92i{crl) + guic^l) + QAicrl) + o{m-^), where 

gu{al) = Bial 

g2i{crl) = Bfhfi 

gs,{al) = BfDr^Var{al) 

m mm 

94{crl) = Y^ wfBfVi - X] JZ WiWjBiBjhYj, 

i=l i=l j=l 

and where Var{dl) = 2{m - p)'"^ YJk=i{^l + ^kf + o{m'^). 

Remark 1: We note that the the MSE of the benchmarked EB estimator in The- 
orem 1 is always non-negative. It is clear that gij(c7^), g2i{a'^)^ and 53j(o"^) are 
non-negative. To establish the non-negativity of (74((T^), let q = {qi, . . . ,q^), 
where qi = WiBiV^ . We can write g^icr^) = q^ {L — Px)Qj where Pj = 
V-^/^X{X^V-^X)-^X'^V-^/^. Thus, giial) > 0, and hence, the MSE in The- 
orem 1 is always non-negative. 

4. Estimator of MSE Approximation 

We now obtain an estimator of the MSE approximation for the Fay-Herriot 
model (assuming normality). Theorem ^ shows that the expectation of the MSE 
estimator is correct up to 0{m~^). 

Lemma 5: Suppose that 

sup\h'{t)\=0{m-^) (4.1) 

for some interval T C M. //cj^, o"^ G T w.p. 1, then E[h{a'^)] = h{a'^) + o{m~^). 
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Proof. Consider the expansion h{a'^) = h{cj'^) + h'{a'^^){a'^ — a'^) for some o"*^ 
between a"^ and a^. Then u*^ G T a.s., and h'{a^) < sup^g^ |/i'(t)| a.s. as 
wen. This imphes E[h'iaf){al - al)] <^sup,eT \h'it)\E\al - al\ = 0{m-^l^) by 
equation (4.1 ) and since E\a1^ — al\ < E2 [(cj^ — a^)^]. Hence, if (4.1 ) holds, then 
E[h{al)] = h{al) + o{m-'). D 

Theorem 2. E[gu{al) + g2iial) + 2g3^ial) + giial)] = guial) + g2iial) + 
93i{(^l) + 9i{(^l) + o{m-^), where gu{al),g2i{crl),g3i{(Tl), and gi{al) are defined 
in Theorem 1. 



Proof. By Theorem A. 3 in |Prasad and Rao||l990[ ), E[gii{al)+g2i{al)+2g3i{al)] = 
9iii^u) + 92i{(yu) + 9^i{^'u) + o{m~^). In addition, we consider E[g4^{a'^)], where 
54(<t2) ^ ^m ^ ^2^2p. _ ^rn ^ ^™ ^ w,w,B,BjhY^ =: g,,{al) + g,2ial). We first 






show that the derivatives of (741 (ct^) and g'42(<T^) satisfy (4.1). Let T = [0,cxd). 
Consider 

5541(0-^) 




sup > uu B, 



^^>o 



' i=l 



0{m-^). 



It can be shown that 
{X'^V-'^Xy^Dl^. Observe that 

d9i2(crl) 



-BiB^Df-BfB,D7' and (X^V-^X)-^ < 



dal 



m rn 

SEE 



WiWj 



B,Dl'hY^\ + \B,Dl'h^\ 



+ BiBjxl{X'^V-^X)-^X'^V-'^X{X'^V-^X)-^Xi 

< 3m'^{ max Wi)^DV Bi{a'^ + Dij){ max hi) 

l<j<m l<i<.m 

< 3m^( max WifDj}Du{a-l + DL)~^{al + i:>(7)( max /ij) 

l<i<m l<i<m 

= 3m2( max WifDj}Du{l + DuDj}){ max /ij) = 0(m"^). 

l<i<m l<i<m 



This implies that sup 



d942{crl) 
da?; 



0{m ^). Since the derivatives of g'4i((T^) and 
542(0'^) satisfy (4.1), we know that E[gi{(T'^)] = (74(o-^) + o{m'~^). D 



5. Parametric Bootstrap Estimator of the MSE of the Benchmarked 
Empirical Bayes Estimator 



ESTIMATION OF MEAN SQUARED ERRORS 



In this section, we extend the methods of Butar and Lahiri (2003) to find 
a parametric bootstrap estimator of the MSE of the benchmarked EB estima- 
tor. Under the proposed model, the expectation of the proposed measure of 
uncertainty of the benchmarked EB estimator is correct up to order 0{m^^). 

To introduce the parametric bootstrap method, consider the model 



ind 



di \ul ^ N{xil3 + u*,Di 



ind 



ur^^N{0,ai). 



(5.1) 



Following Butar and Lahiri] ( |2003 ), we use the parametric bootstrap twice. We 
first use it to estimate fl'ij(o'^),52i(<7^)) and 54(0'^) by correcting the bias of 
9ii{^u) : 92i{(^u) ^ and (74((T^). We then use it again to estimate E[{9f^ 
53i(c^^)+o(m-i). 



3-B\2l 



Butar and Lahiri (2003) derived a parametric bootstrap estimator for the 



MSE of the EB estimator under the Fay and Herriot ( 1979 ) model. Using The- 
orem A.l of their paper, they show that the bootstrap estimator V^ is 



K- 



BOOT 



2[gu{al) + 92i{al)] - E^ [gu{K^) + 52.(^*2)] + E^iief* - ft 



(5.2) 

where E^, denotes the expectation computed with respect to the model given 
in (|5t[, and Of^* = (1 - Bi{crf))ei + Bi{af)xf$. Following their work, we 
propose a parametric bootstrap estimator of the MSE of the benchmarked EB 



estimator that is a simple extension of (5.2). 

We propose to estimate guicr'^) + g2i{<^u) + 9i{^u) by 

2[gu{&l) + g2i{al) + g^ial)] - E, [gii{af) + g2i{af) + gAicrf)] 

and then to estimate E[{9f^ - 0ff] by E^[{ef^* - Of^f]. Thus, our proposed 
estimator of MSE[^f ^^^i] is 

^^B-BOOT ^ 2[5i,(a2) + 52.(^2) + g,{al)] - E, [guiaf) + g2^{af) + g,{a*^)\ 

+ E,[{ef''*-efB)\ 

Theorem 3. ElV^^-^OOT^ ^ MSE[ef^^^] + o{m-^). 



REBECCA C. STEORTS AND MALAY GHOSH 



Proof. First, by Theorem A.l in Butar and Lahiri (2003), we note that 



E*[9ii{K^)] = giii^l) - gsii^l) + Op{m ^), 
E*[92i{K^)] = 92i{^l) + Op{m~^), and 

where g,,{al) = [Bi{al)]*D7^ (§, - xf${al)Y . Also, E4g,{af)] = g,{al) + 



Op{m ^), which follows along the lines of the proof of Theorem A. 2(b) of Datta 



and Lahiri (2000). Applying these results and our Theorem 2, we find 

^B-BOOT ^ ^^^(^2) ^ ^^.(^2) ^ g^.^^2j ^ ^^(^2) ^ ^^.(^2) ^ 0^(^-1). 



This implies that 

^[^B-BOOT] 



9ii{(^l) + 92^1) + 9zi{(yl) + 9i{(^l) + o{m ^] 



since E[gr,i{a'^)\ = g3i{o'^) + o{Tn ^) by Butar and Lahiri (2003), and by applying 
the results of Prasad and Rao (1990). D 



6. Two Applications 

In this section, we consider a data set and report on a simulation study in 
order to compare the performance of the estimator of the MSE of the bench- 
marked EB estimator and the parametric bootstrap estimator of the MSE of 
the benchmarked EB estimator. Tables and figures that result from this can be 
found in Appendix A. 

We consider data from the Small Area Income and Poverty Estimates (SAIPE) 
program at the U.S. Census Bureau, which produces model-based estimates of 
the number of poor school-aged children (5-17 years old) at the national, state, 
county, and district levels. The school district estimates are benchmarked to 
the state estimates by the Department of Education to allocate funds under the 
No Child Left Behind Act of 2001. Specifically, we consider year 1997. In the 
SAIPE program, the model-based state estimates are benchmarked to the na- 



tional school- aged poverty rate using the benchmarked estimator in (2.3). The 



number of poor school-aged children has been collected from the Annual So- 
cial and Economic Supplement (ASEC) of the Current Population Survey (CPS) 
from 1995 to 2004, while the American Community Survey (ACS) estimates have 
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been used since 2005. Additionally, the model-based county estimates are bench- 
marked to the model-based state estimates using the the benchmarked estimator 



in ( |2.3D . 

In the SAIPE program, the state model for poverty rates in school-aged 
children follows the basic Fay-Herriot (1979) framework where 6i = 9i + Cj and 
6i = xj (B + Ui. Here 9i is the true state level poverty rate, Oi is the direct survey 
estimate (from CPS ASEC), Cj is the sampling error term with assumed known 
variance Di > 0, Xi are the predictors, /3 is the unknown vector of regression 
coefficients, and n,, is the model error with unknown variance o"^. The explanatory 
variables in the model are the IRS income tax-based pseudo-estimate of the child 
poverty rate, IRS non-filer rate, food stamp rate, and the residual term from the 
regression of the 1990 Census estimated child poverty rate. We estimate (3 using 
the weighted least squares type estimator f5{a'^) = {X'V~'^X)~'^X'V~^6, and we 
estimate o"^ using the modified moment estimator a^ from Section 2. 

As shown in Table A.l, the estimated MSE of the EB estimator, inse{9f^), 
compared to the estimated MSE of the benchmarked EB estimator, Tase{9f^^^^), 
differs by the constant 54(0"^), 0.025. This constant is effectively the increase in 
MSE that we suffer from benchmarking, and we see that in this case it is small 
(compared to the values of the MSEs). Generally speaking, it is expected to be 
small since g^icr'^) = 0{'m~^). 

In Table A.l, we write mse and mse as the bootstrap estimates of the 
MSE of the EB estimator and the benchmarked EB estimator, respectively. As 
mentioned, we consider year 1997 for illustrative purposes. When we performed 
the bootstrapping, we resampled o"*^ 10, 000 times in order to calculate mse and 
mse . This is best understood through the concept behind our bootstrapping 
approach. Consider the behavior of gii{a'^), the only term that is 0(1). Ordinar- 
ily, giii^u) underestimates gii{cr'^), and E^:[gii{a'^)] underestimates gu^a'^). The 
basic idea is that we use the amount by which E^[gii{a'^)] underestimates gii{^u) 
as an approximation of the amount by which gii{^u) underestimates gii{a'^). 

We run into a problem with the 1997 data, where gii{(j^) is 0, since in this 
case E^[gii{a'^)] overestimates gii{a'^). Recall that 

y^B-BOOT ^ ^^^(^2) + {^^^(^2) _ E4gu{af)]} + 0{m-'). 
Since (?ij(o'^) is and is the dominating term of V^ , many of the estimated 
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MSEs of the benchmarked bootstrapped estimator (mse ) are negative. Also, 
observe this same behavior holds true for the bootstrapped estimator proposed by 



Butar and Lahiri (2003), which we denote by mse . Hence, we do not recommend 
using bootstrapping when (j^ is too close to zero because of the form of cj^. We 
also note that the MSE of the benchmarked EB estimator is always non-negative 
as explained in Remark 1 of Section 3. 

In the second example, we ran a simulation study, using the same covariates 
from the SAIPE dataset from 1997. We generated our data from the model 

e^l^i'- iV(0„A) (6.1) 

where Di comes from the SAIPE dataset. We first simulated 10,000 sets of 



values for 9i and di using (6.1). We then used each set of di values as the data 
and computed the EB and benchmarked EB estimators according to (2.3) and the 
EB formula given below it. In order to use EB, we took (3 = (—3, 0.5, 1, 1, 0.5)^ 
and a'^ = 5. 

In Figure |X]T| we compare the estimator of the theoretical MSE of the bench- 
marked EB estimator and the bootstrap estimator of the MSE of the bench- 
marked EB estimator with the true value, i.e., the average of the squared differ- 
ence between the estimator values and the true 6i, generated according to model 



(6.1). In the upper plot, we see that the estimator of the theoretical MSE of 
the benchmarked EB estimator overshoots the truth very slightly, which shows 
that our estimator is slightly conservative. We find the opposite behavior to be 
true of the bootstrap estimator of the MSE of the benchmarked Bayes estimator, 
meaning that it undershoots the truth slightly. 

In practice, it seems safer to use a MSE estimator that overestimates than 
one that underestimates, and hence, we recommend our proposed MSE estimator 
over the bootstrapped MSE estimator. Using the lower plot, we compared the 
theoretical Prasad Rao (PR) MSE estimator with the associated true value. We 
find the same behavior in the PR estimator as we did in our proposed theoretical 
MSE of the benchmarked EB estimator. The overshoot occurs in the terms that 
the estimators have in common, i.e., 5'ij(cr^); 521(0"^); and g3i{a'^). We see that 
for this particular simulation study where m is particularly large at 10,000, the 
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difference between the two MSEs is indistinguishable. 
7. Summary and Conclusion 

We have shown that the increase in MSE due to benchmarking under our 
modeling assumptions is quite small for the Fay-Herriot model, specifically 0{m^^). 
We have derived an asymptotically unbiased estimate of the MSE of the bench- 
marked EB estimator (EBLUP) under the same assumptions which is correct to 
order 0{m~^). We have derived a parametric bootstrap estimator of the bench- 



marked EB estimator based on work done by Butar and Lahiri ( 2003 ) . Further- 
more, we have illustrated our methodology for a data set for fixed m using U.S. 
Census data. Since our theoretical estimator of the MSE under benchmarking is 
guaranteed to be positive, we recommend it over the one derived by bootstrap- 
ping. We also performed a simulation study that suggests use of the theoretical 
estimator of the MSE under benchmarking. In closing, it is important to pursue 
further work for more complex models, and, in particular, when it is necessary 
to achieve multi-stage benchmarking. 
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Appendix A 

Table A.l: Table of estimates for 1997 



mse(^i) mse(^f'-^) mse(6'f'^^^) mse^ mse^ 



1 


25.16 


21.38 


21.56 


15.72 


1.38 


1.41 


0.02 


0.04 


2 


10.99 


14.94 


15.11 


10.44 


2.12 


2.14 


0.66 


0.68 


3 


23.35 


20.89 


21.06 


11.84 


1.68 


1.70 


0.00 


0.01 


4 


23.32 


22.18 


22.35 


13.85 


1.90 


1.92 


0.37 


0.38 


5 


23.55 


22.71 


22.88 


2.39 


5.92 


5.94 


1.12 


1.13 


6 


9.14 


13.12 


13.29 


6.38 


2.19 


2.22 


0.36 


0.38 


7 


10.34 


13.39 


13.56 


9.85 


2.08 


2.10 


0.39 


0.41 


8 


15.54 


13.06 


13.23 


17.56 


0.91 


0.94 


-0.47 


-0.45 


9 


35.85 


32.43 


32.60 


32.35 


4.92 


4.95 


3.49 


3.50 


10 


18.34 


19.59 


19.76 


3.70 


3.71 


3.74 


0.40 


0.41 


11 


23.52 


20.53 


20.70 


12.93 


1.16 


1.19 


-0.38 


-0.37 


12 


18.98 


13.72 


13.89 


20.87 


2.45 


2.48 


1.24 


1.26 


13 


17.56 


13.64 


13.82 


12.38 


1.70 


1.73 


0.23 


0.25 


14 


14.57 


15.72 


15.89 


3.56 


3.45 


3.47 


-0.06 


-0.05 


15 


11.07 


12.53 


12.70 


7.58 


1.84 


1.86 


-0.23 


-0.22 


16 


11.09 


11.21 


11.38 


8.49 


1.74 


1.76 


-0.24 


-0.22 


17 


11.01 


13.48 


13.65 


9.34 


1.61 


1.63 


-0.15 


-0.14 


18 


23.12 


20.78 


20.95 


13.98 


1.37 


1.40 


-0.12 


-0.11 


19 


21.08 


24.15 


24.32 


15.19 


1.80 


1.82 


0.40 


0.42 


20 


13.18 


12.44 


12.61 


13.63 


2.09 


2.11 


0.56 


0.57 


21 


9.90 


13.16 


13.33 


9.28 


1.65 


1.67 


-0.03 


-0.01 


22 


19.66 


14.38 


14.56 


7.66 


2.46 


2.48 


1.02 


1.04 


23 


13.78 


16.86 


17.03 


4.04 


3.11 


3.13 


0.38 


0.39 


24 


14.34 


10.11 


10.28 


9.91 


1.64 


1.67 


0.16 


0.17 


25 


20.58 


22.30 


22.47 


15.07 


2.42 


2.45 


0.97 


0.99 


26 


18.90 


15.11 


15.28 


15.24 


1.00 


1.03 


-0.37 


-0.35 


27 


17.00 


18.60 


18.77 


12.95 


1.37 


1.40 


-0.21 


-0.19 
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Table A.l: Table of estimates for 1997 (continued) 



28 


9.72 


9.62 


9.79 


7.18 


2.24 


2.26 


0.09 


0.10 


29 


14.06 


12.94 


13.12 


10.23 


1.71 


1.74 


-0.06 


-0.04 


30 


10.94 


6.72 


6.89 


11.35 


1.88 


1.91 


0.50 


0.52 


31 


14.66 


13.28 


13.45 


5.52 


2.48 


2.51 


-0.03 


-0.01 


32 


29.69 


24.44 


24.61 


13.18 


2.62 


2.65 


1.38 


1.40 


33 


23.76 


22.85 


23.02 


3.10 


4.76 


4.79 


0.94 


0.95 


34 


13.90 


16.58 


16.75 


5.70 


2.29 


2.31 


-0.01 


0.01 


35 


18.19 


13.64 


13.81 


11.92 


1.81 


1.84 


0.48 


0.50 


36 


13.91 


13.64 


13.81 


3.95 


3.07 


3.10 


-0.25 


-0.23 


37 


16.09 


21.50 


21.68 


11.14 


1.52 


1.54 


0.24 


0.26 


38 


12.60 


13.43 


13.60 


10.35 


2.53 


2.56 


0.83 


0.84 


39 


14.61 


13.92 


14.09 


3.73 


3.40 


3.42 


-0.01 


0.00 


40 


20.37 


14.60 


14.77 


18.53 


1.04 


1.07 


-0.15 


-0.14 


41 


18.74 


21.21 


21.38 


14.57 


1.49 


1.52 


0.02 


0.04 


42 


12.87 


15.77 


15.94 


12.94 


1.98 


2.01 


0.46 


0.47 


43 


16.09 


16.10 


16.27 


11.94 


1.92 


1.95 


0.28 


0.30 


44 


21.95 


21.38 


21.55 


3.38 


4.05 


4.07 


0.38 


0.40 


45 


11.27 


9.76 


9.93 


9.45 


2.28 


2.31 


0.50 


0.51 


46 


11.15 


10.10 


10.27 


11.95 


2.45 


2.48 


0.86 


0.88 


47 


16.40 


14.96 


15.13 


11.51 


1.20 


1.22 


-0.49 


-0.47 


48 


12.26 


13.17 


13.34 


9.33 


1.85 


1.87 


0.01 


0.02 


49 


18.76 


22.25 


22.42 


13.73 


3.81 


3.83 


2.46 


2.48 


50 


7.60 


11.87 


12.04 


6.41 


2.74 


2.76 


0.97 


0.98 


51 


11.74 


11.70 


11.87 


8.86 


2.08 


2.10 


0.17 


0.19 
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Figure A.l: Comparing Simulated MSEs with True MSEs 
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Appendix B 

Proof of Theorem 1. Observe that 

E[{9fBM^ - e,f] = E[{ef - ^0'] + E[{efB^'^ - off] 
= E[{df-e,f] + E[{9f-ef^-t + 



^EB\2] 



^B 



)^,-t?\ 



= E[{9f - e,f] + Enef^ - eff] + e[{§S - tf] + e[{§S^ - 1 

- 2E[{9f^ - 9f){§^^ - ~e^)] - 2E[{9f^ - ef){l^ - t)] 

+ 2E{{l^^-~e^){l^-t)]. (B.l) 

Next, observe that E[{ef -ei)^]+E[efB -Of]'' = g^,{al)+g2i{al)+g:,i{cjl) + 



B\2i 



oim ^), by Prasad and Rao (1990), where 



gii{al) = Biul 

g2i{crl) = BfhYi 

g^,{al) = BfD7^Vav{al). 

It may be noted that while 5ii(<T^) = 0(1), both g2i{cru) ^^d gsiicr'^) are of order 
0{m~^), as shown in Prasad and Rao (1990). We show that E[{6^ — i)"^] = 



54(0"^) = 0{m ^), whereas the remaining four terms of expression (B.l) are of 
order o{7n~^). 

First, we show that S[(^^-t)2] = 34(0-2). We write ^^-t = _^m^^.^.(^._ 
xj(3) and consider 



Em 



s-tf] 



E <! J^^,5,(^,-a;f;3)l 

Y^wfBfEm - xj$f] + Y,w^WjBiBjE[{e, " xjp){ej - x/p)] 

m 
Y, wfBfiV, - hi) + J^ w^w,B,Bj{-hY^) 

(B.2) 



m 



iy^i 



y w^ B,i Vi — 2^ 2_^ WiWjBiBjh. 

i=l i=l j=l 



16 



REBECCA C. STEORTS AND MALAY GHOSH 



Note that the expression on the right hand side of (B.2) is 0(171 ) since maxi<j<m hi 

^ \ \/\/ I ■ I I ■ I I ■ I I I ■ ■ I |-*>i I. ■■>-■■. Ill>-| XT .^ ^- ^^ ^ ^^ I ,, . . 



0{m ^), which imphes that niaxi<j<j<m /i,- 



0{m-^] 



Next, we return to (B.l) and show that E[{6. 



EB 
w 



9sr] 



o[m 



-1- 



that 



0B\2] 



3EB 



35 



m— 1 m 



m—1 m 
i=l j=i+l 



0EB _ ef){efB - ef) + o{m~^), 



(0:. 



EB 



. Consider 
(B.3) 



9f] 



since J^i wfE[{ef^ -Off] = o{m-^). The latter holds because E[{ef^ -Off] = 
g2i{crl) + 93i{cr'i) = 0{m~^), maxi<j<m Wj = 0{m~'^), and J2i^i = 1- Thus, it 

o{m^^) for all i 7^ j, and we do so 



suffices to show E 



^EB 



BmoEB 



w 



9f] 



80 do (rr ) 

by expanding Of^ about Of. For simplicity of notation, denote -^-\ = — ' „" 

oaf, daf, 



and S$. = ^tm. Then 



S{<Ti) 



2\2 



dial 



^EB 



2^2 



gS 



^^' '%2 „2n , ^ ";;^_'^j* /-2 



da?, 



i^l-^D + ^^^^^i'j:. 



2\2 



2 dial) 



2\2 ^ u 



a„ 



for some o"*^ between a"^ and o"^. The expansion of Of about Of is similar. 
Consider E[{Of^ - Of){Of^ - Of)] for i / j. Notice that 



^[(^1 



EB 



^f)ior 



9f) 



E 



8£r2 da?, ^ " "' 



2 



dal d{alY 



{K - < 



2\3 



+ '-E 
2 



d'eg dof 3 



diKY dal 

Rq + i?i + i?2 + -R3 



-i- 



g^^g 9'^f* ,,2 . 



2\4 



In i?i 



£; 



daldiaiy^""- 



2\3 



^^) 



£; 



s 



' aoB f)2aB 

aa2 9(^2)2 ^""^ """^ ^'^^ ^ ^^ 

5^2 9(^2)2 



{airi{ai < 0) 



(B.4) 



ESTIMATION OF MEAN SQUARED ERRORS 



17 



Observe that 

E 



^^^^^^(.^f.(.S.o) 



dald{alY' 



<<E-^ 



< atE-4 



89} 



del 



1 



Ei 



2^2 



dicrl 



PHal<0) 



sup ■ 



2\2 



dial 



PHai<0) 



o{m ^) 



for all r > by Lemmas [I] (ii) and pi which we have proved in Appendix A. 



Also, -P(ct2 < 0) = 0{m *") V r > 0, as proved in Lemma A. 6 of Prasad and Rao 
p^90| ). Now 



E 



def dH% 

dal d{alY 



~ai - aifliai > 0) 



E 



-E 






^ ^ V (^2_^2^3J(~2<o^ 



(B.5) 



where the second term expression in (B.5 ) is 0{m *") since P{al < 0) = 0{m *") V r > 
0. We next observe that 



E 



daldial)^^''^ "^ 



< Ei 



< Ei 



0{m 




Ei 



Ei 



2\2 



d{<yl 



E--[{al-aif] 



sup 



dHf 

2\2 



d{<yl 



En{al-aif 



since E[[a^ — a^) ] = 0{m ') for any r > 1 by Lemma A. 5 in Prasad and 
Rao (1990). This proves that Ri = o{m^^) since maxi<j<mU'j = 0{m~^). By 



symmetry, R2 is also o{m ^). Finally, we show that R3 is o{m ^). Using a similar 
calculation involving Ri, we can show that 



E 



2oB 



{dal^d^ial) 



2\4 



^i^l-O 



E 



,2nB 



{daiyo^aiy 



(< - '^u 



2\4 



+ o{m ''). 



(B.6) 
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Observe now that 



E 



p)2nB B'^ftB 



2\A 



< Ei 



< Ei 






0{m 



E-i 







e\ 



( d^ef 




2^2 
u 



Plugging this back into (B.6), we find that E ,„ o"o „o . i.r, (cfI — cr, 

[(9^2)2 02(^2)2 

o{m^^). Hence, R3 is o(m~^). Finally, by calculations similar to those used for 



d'^9^ d^e 



2qB 

j* 



2\4 



(B.4), we find that 



Rn 



E 



da?, da?. ^ " "^ 



E 



deldel 

da? da? ^"^^ 



2\2 



+ o(m ^). 



Take S = V-X{X^V-^X)-^X^ = {I-P^)V, where Px = X{X'^V-^X)-^X^ , 
= X{X^V-^X)-^X^V-^, and 
d§P 



write P^ = X{X'^V''^ Xy^ X^V'^ , and let e^ be the ith unit vector. We can 



show 



dcrl 



Bid' T.V-^u, where u = O-Xp. Define Ai 



BiBjV'^i:eiej^T.V-^a.nd 



consider 



E 



deldJl 

dal dal ^^" 



a. 



2\2 



E{u' Menial- aiY\ 



2\2 



_2\2i 



Cov(ii^ MjU, {ai - aiY) + E[u' Ai,u]E[{ai - aiY\ 
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Using Lemma p^ and the relation (/ — Px)^ — {^ ~ Px)y ■, 

= {m- pr^Cov{u^AijU, [u^{I - Px)u - tr{(/ - Px)V}f) (B.7) 
= {m - p)-^ Cow {u^AijU, [u^{I - Px)uf) 

- 2(m - p)-^ Co-v {u^ AijU, u^{I - Px)u)tr{{I - Px)V} 

= {m-p)-^Utv{AijV{I - Px)V}tT{{I - Px)V} 
+ 8tv{AijV{I - Px)V{I - Px)V} 

- Aiv{AijV{I - Px)V]iT{{I - Px)V]\ 

= 8{m - p)~hv{AijV{I - Px)V{I - Px)V}. 

= 8(m - p)-^BiBjej^^V-^{I - Px)V{I - Px)^"^Sei, 

where tr denotes the trace. Observe that (/ — Px)V~^T, = I — (Px)'^ a-nd 
(/ - P^)F(I - (P^f) = S. Then 

Cov{u^AijU, {dl - alf) = 8im-p)-^B,Bje/^V~\l - Px)V{I - Px)V-^T.ei 

= 8(m - p)-'B,Bje/iI - pI)V{I - {P^f)ei 
= 8{m - py^BiBje/T,ei 
= 8{m-p)-'^BiBje/Vei + 0(m"^) = 0(m"^), 

since the first term is zero because i ^ j and V is diagonal. We now calculate 

E[u^Aiju] = tr{B,BjV-'^T.eiej'^T,V-'^T.} = BiBjEj^T.V-^T.V-'^T.ei. 

Observe that T.V'^T. = 1 - (P^)^ - Px + Pxi^xf- Then, after some compu- 
tations, we find that E[u Aiju] = BiBjEj V^^Si + 0{m~^) = 0{'m~^) since 
i / j. By Lemmag ^[(^^ _^2)2] ^ 2{m-p)-^ Ek=Ml + Dk)^ + 0{m-^). Then 

E[u^A,ju]E[{al - alf] = oim-'), 

since i ^ j. This implies that Rq = o{m~^), which in turn implies that 

E[{Of^ - Of){Of^ - Of)] = o{m-^) for i / j, (B.8) 

since Rq, Ri, R2, and R3 are all o{m^^). Finally, this and (B.3) establishes that 
E[{§S^-~eSr]=o{m-'). 
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We return to (|bjJ) to show that E[{ef^ - ef){9^^ - 6^)] = o(m-i). By the 
Cauchy-Schwarz inequahty, we find that 

E[{ef^ - ef)CeS^ - 'e^)] < eI [{ef^ - eff] eI [{§^^ - ~e^f] = o{m-% 

since the first term is 0{m^^''^) and the second term is o{m^^''^). 

For the next term of (B.l), we are interested in showing that E[{9, 
6f){9^ — t)] = o{m~^). First, by Taylor expansion, we find that 

1 dHP. 



EB 



ptoB 
^EB aB _ ^^i (%.2 J2\ , 



-2 _ 2\2 
2\1 \ u ^u) 



for some a'^ between a'^ and a^. Consider that 6^ — t = — ^^ WiBi{9i — xj (3). 
Then 

'd9f 



E[{ef^-ef){B^-t)] = -Y,^,B,E 

j 






0{al 






2\2 



:— R4 + R5. 



Observe that 



E 



T , 



Q^iK-<m-xj'f3) 



-alE 



pjoB 

^{e,-xj''p)i{al<o) 



(B.9) 



E 



E 



E 



-E 



E 



fjoB 

^{al-al){e,-x/m^l>0) 
00 ^ 



-^K-ai){e,-x/f3)I{ai>0) 

fioB 

Ai^i-'jim-x^-p) 

PjoB 

^{al-al){e,-x/mal<0) 

fjoB 

^i^l-'ylm-x^-p) 



+ o(rn ^) 



+ o{m '') 



+ o[m 



since we may observe that E 



89, 



B 



^ial)ie,-x/p)lial<0) 



o{m "') and 
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E 



9^ff.2 



^ial-al){e,-xfP)I{al<0) 



B 



o{m ''). Now, note that 



oaf, 



and write Dij = BiV "^HeiSj^. Then by calculations similar to those in (B.7), 
we find 



E 



f)f)B 



Cov{vF DijU,al-al) 

(m - p)-^Cov{u^DijU, u^{I - Px)u - tr{(/ - Px)V]) 

2{m-p)-hT{D,,V{I-Px)V} 

2(m - p)-hT{B,V-^lleie/V{I - Px)V} 

2{m - p)-^Bie/V{I - Px)V-^T.ei 

2(m - p)-^Bie/V{I - {P^Y)ei 

2{m-p)-^Bi[e/Vei-hYj] 

2{m - p)'^Bie/Vei + o{m''^). 



With this, we find that 



Y,^,BjE 



^^f i.2 2 



^(<-<)(^,-^i'/3) 



2{rn-p)-^Blwi{al^Di)^o{rn'^) = o{rn-^) 



Hence, i?4 is o{m ^). We now show that -R5 = o(m ^). By calculations similar 



to those in (B.9), 



Y,w,B,E 



d{a: 
Y^WjB.E 



^.i^l-crim-x,^^) 



^ ^i* ^-2 ^2\2 



d{c7l 



2A2 



ai-aiY{e,-Xj'P) 



+ olm ). 
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Recall that E 
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2' 



Y^^jBjE 



" (al - al)\e, - x/ fi) 



0{m-^) by (B.2). Now note that 



^ ^i* (zl2 _2N2/fl ^ T 



< Ei 



< Ei 






e'^ [{al - alf] e\ 



Y,w,Bji§,-x/p) 



sup 



d^f 






^^ m - ^If] e'^ 



Y.wjB.iOj-Xj'^^) 



by Lemma mil), by Theorem A. 5 of Prasad and Rao (1990), and by expres- 
sion (|r2[. Thus, i?5 is o(m-i), and E[{9f^ - ef){§^ - t)] = o{m-^). 

For the last term in (B.l), we use the the Cauchy-Schwartz inequality to 



show 



^[(^r - ~OS){k - t)] < E^^ [(^r - ~0^?]E^^ [{k - tf] = o{ 
This concludes the proof of the theorem. 
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